-
Notifications
You must be signed in to change notification settings - Fork 78
Clarify BIDS dataset organization and distribution considerations #688
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @yarikoptic, thanks for putting these changes together as it clarifies several pieces for me.
cc @satra
We may also need to change the above statement to more accurately reflect the examples provided. Perhaps to the following:
|
- Add note about BIDS Raw datasets being distributable without derivatives - Include dataset_description.json in directory structure examples to emphasize where we observe legit BIDS datasets - Explain disadvantages of nested dataset organization for distribution - Clarify that sourcedata can contain Raw, non-BIDS, or derivative datasets - Add requirement for BIDSVersion key to identify BIDS datasets in subdirectories - Re-Include example of non-nested dataset organization in my_study folder (I based this change on top of the removal proposal in bids-standard#687)
Co-authored-by: Kabilar Gunalan <[email protected]>
852dcb2 to
32e5edb
Compare
effigies
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
WDYT?
julia-pfarr
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't know, I don't want to initiate a new discussion (or this discussion again...) but I kind of don't understand why we need to serve all these different cases of where to store what?
Why can't we just say
root/sourcedata--> out of scanner data
root/raw--> raw data in BIDS
root/derivatives--> anything starting with preprocessing, in BIDS- If you have only one of those three, adding a second level folder is not necessary.
Can someone please explain to me why it is not possible to do this? I'm clearly missing something and I would like to understand...sorry!
|
@julia-pfarr a quick answer on "Why can't we just say" is that because it is not what the standard is permitting/expecting ATM. This repository is for the website which just provides a little more of "expanded for a user re-digestion of the specification". If we start saying here something not allowed by the specification, we would do disservice IMHO. Any decisions to change things up should be discussed against specification first. |
TODO: add generic phrasing based on what @effigies suggested Co-authored-by: Chris Markiewicz <[email protected]> Co-authored-by: Julia-Katharina Pfarr <[email protected]>
Ok, I see, that makes sense, thanks! So I understand from your answer that my suggestion is technically possible and not totally unreasonable, this is just not the right place for it. |
|
Hi @yarikoptic, the conversation above was resolved, but I don't think the following update to the text has been made. Original: Suggestion: |
Co-authored-by: Kabilar Gunalan <[email protected]>
|
@yarikoptic I fixed my suggestion, wdyt? Should I merge or do you want to change something? |
to emphasize where we observe legit BIDS datasets
This PR is on top of the
rawdata/at the top-level of a BIDS dataset #687by @kabilar
Potential TODOs but may be after in a separate PR: